04:00
2026-06-03
arxiv.org
large-language-models
Hallucination Is Linearly Decodable from Mid-Layer Hidden States in Quantized LLMs
Researchers found that a linear probe applied to mid-layer hidden states of quantized large language models can detect hallucinations with up to 1.000 AUROC, significantly outperforming sampling-based…